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ABSTRACT 

The European Southern Observatory's (ESO) Visible and Infrared Survey Tele- 
scope for Astronomy (VISTA) is a 4-m class survey telescope for wide-field near- 
infrared imaging. VISTA is currently running a suite of six public surveys, which 
will shortly deliver their first Europe wide public data releases to ESO. The VISTA 
Kilo-degree Infrared Galaxy Survey (VIKING) forms a natural intermediate between 
current wide shallow, and deeper more concentrated surveys, by targeting two patches 
totalling 1500 deg^ in the northern and southern hemispheres with measured 5tT lim- 
iting depths of Z ~ 22.4, Y ~ 21.4, J ~ 20.9, H ~ 19.9 and Ks ~ 19.3 (Vega). This 
architecture forms an ideal working parameter space for the discovery of a significant 
sample of 6.5 < z < 7.5 quasars. In the first data release priority has been placed 
on small areas encompassing a number of fields well sampled at many wavelengths, 
thereby optimising science gains and synergy whilst ensuring a timely release of the 
first products. For rare object searches e.g. high-z quasars, this policy is not ideal since 
photometric selection strategies generally evolve considerably with the acquisition of 
data. Without a reasonably representative data set sampling many directions on the 
sky it is not clear how a rare object search can be conducted in a highly complete and 
efficient manner. 

In this paper, we alleviate this problem by supplementing initial data with a re- 
alistic model of the spatial, luminosity and colour distributions of sources known to 
heavily contaminate photometric quasar selection spaces, namely dwarf stars of spec- 
tral type M, L and T. We use this model along with a subset of available data to 
investigate contamination of quasar selection space by cool stars and galaxies and lay 
down a set of benchmark selection constraints that limit contamination to reason- 
able levels whilst maintaining high completeness as a function of both magnitude and 
redshift. We review recent follow-up imaging of the first VIKING high-z quasar can- 
didates and find that the results lend considerable support for the choice of selection 
constraints. The methods outlined here are also applicable to rare object searches in 
a number of other ongoing and forthcoming projects. 

Key words: surveys - techniques: photometric - galaxies: high-redshift - quasars: 
general 
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1 INTRODUCTION 

The existence of luminous quasars at z ~ ff^ suggests that 
the formation of the first supermassive black holes began 

^ There are a number of variables which arc conventionally de- 
noted z or Z. To avoid confusion they are defined here as z; red- 
shift, z; the SDSS passband, Z; the VISTA passband. 
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within the first few hundred million years of the big bang. 
As such high-z quasars offer powerful probes of the early 
Universe, setting constraints on early st r ucture forrnation 
llStiavelh et al.ll2005l : IWillott et aLll2005al : IShen et alJlioOTl . 
etc.), chemical enrichment (|Maiolino| l2'00g, and refs. therein) 
and the state of th e inter-galactic medium at the end of 
cosmic re-ionisation jFan et al.ll2006l ). 

In the last decade, o ptical surveys suc h as the Sloan Dig- 
ital Sky Survey (SPSS: lYork et all bOOol ) an d the Canada 
France High-z Quasar Survey (CFHQS; IWillott et al] 
l2005bl ) have increased the number of known quasars at z ~ 6 
to over 50 fe.g. lJiang et al ] |2008l : IWillott et al.|[201ol ). At the 
single scan sensitivity limit of the Sloan camera {z ~ 21.0) 
the surface density of z > 6 quasars brighter than 2: < 21 is 
of order pq ~ 0.01 deg~^, so this represents a considerable 
effort. 

High-z quasars are generally initially selected for follow- 
up observations by virtue of their photometric colours. The 
key to the photometric detection of a strong quasar candi- 
date is to isolate the characteristic flux break blueward of the 
Lyman-alpha (Ly-a) transition brought about by the thick- 
ening of the Ly-Q forest towards higher redshifts. Colour 
selection techniques exploit this break by placing a blue ob- 
serving band across the Ly-Q transition and a further band 
just redward. Objects suitable for follow-up spectroscopy 
should then present themselves as extremely red faint point 
sources. A carefully placed colour cut is usually sufficient 
to confine the most interesting sources to a region of colour 
space distinct from most main sequence stars and galaxies. 
In principle, this facilitates candidate selection in just two 
passbands, but in practice a third passband is necessary to 
permit a measurement of the continuum level redward of 
Ly-a as a means of reducing contamination from cool de- 
generate stars. This is the technique pioneered by the SDSS 
and adopted in the CFHQS; both make initial selections in 
i — z and take follow-up J-band imaging o f promising can- 
didates prior to spec troscopic confirmation (|Fan et al.ll200ll : 
IWillott et al]|2005bl ). 

At z > 6 Ly-a begins to shift out of the Sloan 2 band 
and while projects like the Panoramic S urvey Telescope and 
Rapid Response System (Pan-STARRS: 'Kaise r et al.ll2002l ') 
aim to progress towards higher redshifts in a predominantly 
optical parameter space (by selecting i dropouts), faint opti- 
cal detections make it difficult to reject the numerous Galac- 
tic stars with scattered quasar-like colours that can out num- 
ber high-z quasars by a factor 10*. 

To this end a number of other surveys have taken a 
different approach, employing the near-IR as their selec- 
tion space, a tactic which has had recent encouraging suc- 
cess in the UKIRT Infrared Deep Sky Survey (UKIDSS; 
iLawrence et al.ll2007f ). One of the innovations behind the 
UKIDSS photometric system, was to recognise that near- 
IR filter combinations incorporating z, J, H and K alone 
are inadequate for z > 6 quasar selection, since cool star 
spectra tend to pea k in the near-IR and cro wd the quasar 
colour-colour locus l|Warren fc Hewettll2003 '). The solution 
was to introduce an observing band interme d iate b etween z 
and J and was put forward bv lHewett et al. I (|2006t ). The Y- 
band filter was optimised for this purpose and has been used 
to great effect in the UKIDS S-LAS (UKIDSS Large Area 
Survey; ILawrence et al.ll2007l ) where seven z > 6 quasars 
have been identified up to the UKIDSS eighth data release 



JVeiiemans et al.ll2007l : iMortlock et al.ll2009l : iMortlock et all 
l201ll . Venemans et. al. in preparation, Patel et al. in prepa- 
ration). 

Working in the same parameter space as UKIDSS is 
the European Southern Observatory's (ESO) Visible and In- 
frared Survey Telescope for Astronomy (VISTA), a 4-m class 
wide field survey telescope located at ESO's P aranal obser- 
vatory in Chile (|Emerson fc Sutherland! bOld l. More than 
80 per cent of VISTA time is devoted to running a suite of 
six public surveys, which will deliver European public data 
releases in the near future. Of these projects, the VISTA 
Kilo-degree Infrared Galaxy survey (VIKING) has a com- 
bination of depth and area that is ideally suited for quasar 
searches at the highest redshifts. Over the nominal five year 
survey life span, VIKING will image a total of 1500 deg'^ in 
the Z, Y, J, H, Ks near-IR bandpasses centred on the north- 
ern Galactic cap and the southern Galactic pole. VIKING 
will detect in excess of 6 x lO" stars and 20 x 10® galax- 
ies down to a limiting 5(t detection limit of J ~ 20.9 (Vega 
system) . 

VISTA's transmission curves are shown in Figure [T] 
These include the relevant contributions from the instru- 
ment and detector, mirror refiectivity and transmissions 
through the atmosphere and filters under typical photomet- 
ric conditions . Also plotted is th e Large Bright Quasar Sur- 
vey 's rLBQS:lHewett et aLliToOsh template quasar spectrum 
(|Francis et allliggiT ). redshifted to z = 6.8 and with the char- 
acteristic HI absorption trough artificially imposed blueward 
of redshifted Ly-a at A (Ly-a) = 1216 A x 7.8. The spectra 
of cool M8- and T5-dwarf stars are plotted alongside the 
quasar spectrum to highlight, the recognised difficulty of 
differentiating between high-z quasars and cool stars at the 
foot of the main sequence and below the hydrogen burning 
cut-off. 

M-dwarfs out-number high- z quas ars by at least a fac- 
tor ~ 10^ (e.g. iBochanski et al.l (|2010l ). obtained a volume 
hmited sample of ~ 15 x 10® SDSS objects with M-dwarf 
like colours at z < 21.2. Over a similar depth and area 
there are currently 19 SDSS quasars with z ~ 6 reported 
by Fan et. al. 2006.), while the cooler L an d T spectral 
types are by all accounts, a lot more rare (jLodieu et al.l 
I2OO9I : iBurningham et al.ll201()l '). The broad similarities be- 
tween high-z quasar and M-dwarf spectra, coupled with their 
relative surface densities means that most colour selected 
high-z quasar candidates are in fact M-dwarfs with scattered 
quasar like colours resulting from large photometric errors. 
The function of a good colour selection criterion then, is to 
maintain a large selection space while simultaneously min- 
imising contamination from false positives and optimising 
the completeness of the search. 

In the vein of other wide field imaging surveys before 
it, the initial VIKING data release consists of a few high 
priority fields to provide a testbed for future larger releases. 
In the first public VIKING data release ~ 75 deg'^ will be 
made available. Precision selection strategies can be devel- 
oped with a detailed understanding of the error perturbed 
magnitude and colour distributions of the target and con- 
taminant populations, both as a function of redshift and 
direction on the sky. These strategies will no doubt develop 
with the accumulation of large amounts of data. However, 
the first photometric follow-up of VIKING candidates was 
undertaken in summer 2011. Although by this time there 
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was ~ 350 deg^ of data available, there was insufficient time 
to analyse it in detail other than to select candidates based 
on selection constraints derived from smaller initial data 
sets. Thus, to optimise selection constraints and provide a 
complete, efficient and non-biased search it was necessary 
to u se a combi nation of initial data and detailed modelling 
(e.g. lFan|[l999l '). 

The following article describes a model of the magni- 
tude, colour and spatial distributions of the cool star and 
quasar populations, which was developed and assessed along 
with ~ 200 deg^ of initial imaging data to lay down some 
baseline magnitude and colour selection criteria to apply to 
the ~ 350 deg^ available for the first follow-up observations. 

The paper is organised as follows; In Section[2]VIKING 
Vega system synthetic photometry is computed for typical 
examples of cool-stars and quasars. In Section [3] offsets from 
the Vega system are measured in the VIKING data. In Sec- 
tion|4]the synthetic photometry is employed along with num- 
ber count models to populate a synthetic realisation of the 
cool-star and high-z quasar component of the VIKING cat- 
alogue. In Section [5] the catalogue and available data are 
used to place constraints on quasar selection space and as- 
sess completeness. In Section [5] we assess the selection crite- 
rion in light of recent foUowup observations. We summarise 
our findings and outline future work in Section [T] 

Henceforth, all quoted photometry is defined on the 
Vega system, such that Vega is a zero magnitude star in all 
VISTA passbands. Magnitudes are quoted in the traditional 
logarithmic form except fo r those in the Z ban d, which are 
quoted in the asinh form l|Lupton et al.l 0*9991 ). asinh mag- 
nitudes take the same values as the equivalent logarithmic 
form for high signal-to-noise (S/N) measurements, but have 
the advantage of linear dependence at low S/N or even neg- 
ative fiux, which will be the case for most z > 6.5 quasars. A 
softening parameter denoted b, is required to define the scale 
at which linear dependence dominates. For the Z band we 
choose b equivalent to Z = 24.1, which gives the asinh mag- 
nitude of an object with zero flux and is set approximately 
equal to the sky noise seen in the initial data frames. 

Throughout this work we adopt the following values for 
the Hubble constant and the matter and cosmological con- 
stant energy density parameters; Ho = 70 km s~^ Mpc~^, 
Om = 0.28 and 0,a ~ 0.72 respectively (|Komatsu et al.l 
l2009l ). 



2 Z, Y, J SYNTHETIC PHOTOMETRY AND 
CHARACTERISTIC COLOURS 

Characteristic colours were calculated for a set of cool stars 
and quasars by integrating theproduct of their spectra and 
the VISTA response functiono including measured values 
for the wavelength-dependent losses resulting from atmo- 
spheric extinction, mirror reflectivity and instrument and 
detector response. 



http: / / www.eso.org/ sci / facilities /par anal /instruments /vista/inst / 



2.1 High-redshift quasars 

In order to obtain a representative spread of quasars in 
colour-colour space, we use a s ample of 386 quasars from 
the SDSS Quasar Catalogue IV (jScimeider et al.ll2007l '). The 
sample spans a redshift range of 3.1 < z < 3.2, over which 
there is low stellar c ontamination and high completeness 
(Ric hards et al.ll2002l '). 

Recent comparisons between high and low-z quasar 
spectra have shown no obvious evidence for metalicity evo- 
lution in the rest - frame UV spectra l energy distributions 
(| Jiang et al.ll2007l : I Juarez et al1l2009l : iMortlock et"al]|201ll . 
although see Jiang et al. 2010 for possible evolution of dust 
properties). Thus, it is a reasonable approximation to simu- 
late intrinsic quasar spectral energy distributions at z > 6.5 
by artificially redshifting the low-z SDSS sample. 

Significant differences between z > 6.5 quasars and 
those at lower redshift are apparent blueward of Ly-Q due to 
increasingly strong absorption in the intergalactic medium 
with redshift. At z = 6.5 the optical depth of the intergalac - 
tic medium to Ly-Q photons is ^ 1 (e.g. iFan et alll2006l V 
For the same neutral density the optical depths to Ly-/3 and 
Ly-7 are factors of 6.2 and 17.9 less than for that of Ly-a, 
but their line strengths are co mparatively small a nd they can 
be reasonably neglected (e.g. IWiese et al.lll96^ . Transmis- 
sion blueward of A = 1216 A (1 -I- z) is thus effectively nil, 
and absorption by intergalactic hydrogen can be adequately 
modelled in z > 6.5 quasar spectra by setting transmission 
shortward of redshifted Ly-a to zero. 



2.2 Stars 

A library of cool star spectra has been collected from the 
literature covering VISTA's Z, Y, J, H and Kg passbands, 
these include all 21 M-, 3 L- and 22 T-dwarf spectra on 
which iHewett et al.l (|2006| ) conducted synthetic photometry 
in the WFCAM passbands. References for these objects can 
be found in the above paper. The library is further supple- 
mented by spectra obtained from the DwarfArchiveqj, the 
IRTF (NASA Infrared Telescope Facility) spectral librarjQ, 
the SpeX Prism spectral librarjjf], the Keck LRIS (Low Res- 
olution Imaging Spectrometer) spectral librarj|f| and Sandy 
Leggett's L- and T-dwarf archive^ 



2.3 Selection in Z, Y, J 

The resulting synthetic colours have been checked for con- 
sistency by convolving stellar sources in the WFCAM pass- 
bands and com paring results with those presented for the 
same sources bv lHewett et al.l (|2006l ') finding good agreement 
between the two. There is also broad a greement between 
these results and ob servational work (e.g. Zhang et al.ll200^ : 
iLeggett et"ai]|2O10D . Simulated i — z colours of the quasar 
sample over the SDSS passbands are also consistent with 
the corresponding SDSS photometry. 

At this stage it is useful to review the general approach 



^ http:/ /spider. ipac.caltech.edu/staff/davy/ARCHIVE/links.shtml 

* http: / /irtfweb. ifa.hawaii.edu/~spex/IRTF_Spectral_Library/ 

^ http: / /pono. ucsd.edu/ ~adam/browndwarfs/spexprism/ 

^ http:/ /www. stsci.edu/~inr/ultracool. html 

^ http: / /staff. gemim.edu/~sleggett/LTdata. html 
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Figure 1. Transmission curves for the VISTA Z, Y, J, H, Ks photometric system, including wavelength dependent losses are labelled and 
shaded grey. Also shown is the LBQS template quasar spectrum artificially redshifted to z = 6.8 and with the characteristic absorption 
trough imposed blueward of redshifted Ly-a (solid black curve). For comparison the spectra of cool M8- and T5-dwarfs are also plotted 
(dash-dot and dashed curves respectively). 



to near-IR high-z quasar selection in visual form. In Figure[5] 
a Z, Y, J colour-colour diagram is presented with the derived 
synthetic photometry. The quasar tracks are colour coded 
according to their redshift, in steps of Az = 0.01 and the av- 
erage track is labelled at various redshifts for clarity. The box 
bounded by the dashed Une illustrates the general principle; 
the box includes almost all 6.5 < z < 7.5 quasars, rejecting 
galax ies as shown by the El locus ( Rowan- Robinson et al.l 
1200^ and almost all foreground stars. 

The position of each quasar in Z — Y is a function of 
redshift, reflecting the movement of the Ly-a break through 
the Z-band. The Y — J colours have only minor dependence 
on redshift until z = 6.9 where the locus begins to turn over 
as Ly-a enters the observed Y band. Conversely, the distri- 
bution of stellar colours is driven largely by temperature, 
which acts to redden spectral types approaching early-L. 
In later types, behaviour is more complicated as clouds act 
to mask spectral features and redden near-IR colours (see 
lKirkpatrickll2005l ). With the appearance of pressure broad- 
ened Nal and KI doublets in the reddest optical bands at 
mid to late L, the sequence is forced redward in Z — Y into 
the T sequence. 

The simple cuts outlined above will need to be modi- 
fied for a real survey, since random photometric errors will 
broaden the measured locus, and thus can scatter stars and 
galaxies into the quasar selection region. Since all stars and 
galaxies outnumber genuine high-z quasars by a factor > 10® 
(J < 21), a realistic survey will need to ensure that the cuts 
are sufficiently far from the stellar locus to reject almost all 
stars and galaxies. In the reminder of this article we examine 
how best to place these cuts such that they isolate the vast 
majority of high-z quasars without selecting unacceptable 
numbers of contaminants. 



Quasar redshift 
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Figure 2. The Z, Y, J colour plane; high-z quasar redshift evolu- 
tion tracks are colour coded as indicated by the colour bar. Star 
and galaxy colours are indicated by the legend. The dashed line 
describes the general principles of quasar colour selection in the 
Z, Y, J passbands, enclosing the vast majority of high-z quasars 
while rejecting the vast majority of galaxies and foreground stars. 



3 Z, Y OFFSETS FROM THE VEGA SYSTEM 

VISTA photometry is calibrated with refer ence to a set of 
2MA SS (the Two Micron All-Sky Survey; ISkrutskie et al.l 
l2006t ) standards, which have been measured with reference 
to the spectrophotometric standard star Vega (a— Lyrae). 
This implies that Vega is a zeroth magnitude star in all 
VISTA bandpasses. The calibration involves measuring the 



offsets between the 2MASS standards and VISTA observed 
stars via a set of colour equations. Since 2MASS photom- 
etry was measured in the J, H, Ks bands alone the basic 
assumption is that the 2MASS colour equations can be lin- 
early extrapolated to cover the VISTA Z and Y bandpasses. 
Any divergence from linearity will result in an offset from 
the Vega system in the extrapolated wavele ngth region. 

A similar occurrence was noted by iHodgkin et al.l 

(|2009l ) in UKIDSS data release 2, where a rigorous analysis 
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of the Z, Y, J, H and K photometry against the overlapping 
SDSS footprint found a significant offset in the Y-band. A 
similar analysis for VISTA will no doubt be attempted when 
the catalogues are diverse enough to make precise measure- 
ments. However, since our quasar selection criterion is to be 
partly based on Vega zeroed synthetic photometry, it is im- 
portant to at least tentatively determine any offsets present 
before we begin. 

VIK ING data is processe d by the VISTA Data Flow 
Syst em (Emerson et al.l I2OO4I ) in its pipeline (jlrwin et al.l 
2004) and retrieved from the VISTA Science Archiv^fl (VSA; 
Hamblv et al.ll2004l ). At the time of this analysis the VSA 
contains the latest vl.O VIKING release which comprises of 
some 200 deg^ of imaging in Z, Y, J and 120 deg^ of imaging 
in all five bands; 90deg^ of this has complementary SDSS 
overlap. Off sets in this data set a re measured following the 
approach of iHodgkin eTall (|2009l '). the main points of which 
are summarised here. The reader should refer to the original 
paper for a full description including the maximum likeli- 
hood fitting procedures referred to below. 

A sub-sample of high signal-to- noise (S/N > 10.0) 
point-like sources detected in both VIKING and the SDSS 
was defined and those sources with Vega like SDSS colours 
i.e. AO stars with u — g,g — r,r — i and i — z in the range 
-0.1 to 0.1, were flagged. Figure [3] plots the J, H and Ks 
colour-colour diagram for this set of sources. Blue points 
show the photometrically selected sample of AO stars. The 
small number of these objects highlights the difficulty of 
investigating the VIKING colours at this early stage. The 
median colours of this sample are J — H = 0.005 ± 0.038 
and H — Ks = 0.033 ± 0.023 (where the median absolute de- 
viation has been used to estimate the standard deviation), 
which perhaps hints at small offsets in one or both of H/Ks. 
Given that each of these colours has been directly calibrated 
from 2MASS standards, we continue our analysis assuming 
that any offset in these passbands is small. As we will show 
shortly, there is evidence that this assumption is a good one. 
Also plotted in Figure [3] are red crosses, which show the 
synthetic colours of the stellar sequence derived from the 
Bruzual-Persson-Gunn -Stryker (BPGS) spectroscopic atlas 
l|Gunn fc Strvkeilll983l ') in Section 1. The synthetic locus is 
in overall good agreement with the locus of bright stars in 
this colour-colour space. 

Following Hodgkin et al., we now extend the baselines 
on each Z, Y, J, H and Ks colour-colour combination in 
one direction by plotting all possibilities against u — Ks. 
Offsets from zero are then measured from straight line fits 
to the data. To clarify this procedure an example of the 
fitting in the Z, J, u and Ks colour plane is shown in Fig- 
ure |4l Due to limit ed statistics our ap proach to the fitting 
differs slightly from lHodekin et al.l |20P9) who bin their data 
and take measurement errors based on standard deviations 
on binned median values. Conversely our approach is to fit 
to the photometrically selected AO stars and retrieve er- 
rors directly from the maximum likelihood fitting procedure. 

® The VSA holds the image and catalogue data products gen- 
erated by the VISTA Infrared Camera (VIRCAM). The pri- 
mary contents of the archive originates from the VISTA Pub- 
lic Surveys. The archive can be queried via the Structured 
Query Language (SQL) and several interactive web forms at 
|http://horus.roe.ac.uk/vsa/index.html| 




H-Ks 

Figure 3. The VIKING J, H, Ks colour plane for high confidence 
stellar sources matched in VIKING and SDSS. Blue points are 
stars with Vega like colours in all SDSS passband combinations. 
Red c rosses are synthetic photometry from BPGS spectroscopic 
atlas dGunn fc Strykcr..l983i) . 
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u-Ks 

Figure 4. Maximum likelihood fit to stars with Vega like colours 
in SDSS with counterparts in VIKING. Similar fits were made in 
all other VIKING colour combinations vs. m-Ks, the results are 
summarised in Table [T] and in the main text. 



The results show appreciable offsets in the Z and Y bands 
only. These are summarised in Table [1] The correspond- 
ing offsets found in the J, H and Ks bands are consistent 
with zero; A,th = 0.006 ± 0.004, Ajks = 0.008 ± 0.005, 
Ahks = 0.013 ±0.006. 

A further two results are shown in Table [T] these were 
obtained by fitting to the blue end of the stellar locus from 
all the bright stars in the original selection. Again a straight 
line maximum likelihood fit is applied in the Z, J, Ks and 
Y, J, Ks colour-colour spaces and the offsets from the ori- 
gins are measured. The best fitting line to the Z, J, Ks 
stellar locus is shown in Figure [5] and the measured off- 
sets in both the Z, J, Ks and Y, J, Ks colour spaces are 
summarised in Table [T] In these cases the measured offsets 
are slightly larger than those measured in the VISTA-SDSS 
colour-colour spaces. These may well be attributed to small 
offsets in the J or Ks passbands as tentatively suggested by 
FigureO Individual offsets to the J, H and Ks passbands im- 
plied by these fits are clearly smaller than those attributed 
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Figure 5. The VIKING Z, J, Ks colour plane for high confidence 
stellar sources matched in VIKING and SDSS. A sample of blue 
stars is plotted with filled circles, obvious outliers were clipped 
from the sample and are shown in red. The blue line shows the 
maximum likelihood straight line fit to this distribution, while 
the red line shows the fit to the main sample. The Z band offset 
is measured from the blue star intercept to the origin. 





Az 


Ay 


Azj, Ayj 


-0.111 ±0.002 


-0.082 ± 0.002 


AzH, Ayh 


-0.109 ±0.004 


-0.086 ± 0.004 


AzKs: AyKs 


-0.096 ± 0.006 


-0.073 ± 0.005 


AzjKs, AyJKs 


-0.146 ±0.003 


-0.135 ±0.003 


Mean 


-0.116 ±0.008 


-0.094 ± 0.007 



Table 1. Summary of Z and Y offsets from the Vega system. 

to Z and Y and will not present significant problems in tlie 
following sections. 



4 MODELLING THE VIKING SURVEY 

Having used synthetic photometry to predict the colours of 
cool stars and quasars in the VISTA Z, Y and J bandpasses, 
the next step is to combine this information with number 
count models of the cool-star and quasar populations to 
produce a simulation of the Z, Y and J band photometry 
expected from VIKING. 

4.1 Quasar number counts 

It is well established that the space density of quasars 
peaks at z ~ 2.5 and declines rapidly thereafter 
(e.g. [Richards et~al] |2006| ). By z ~ 6 there are ~ 400 
quasars brighter than z = 21.0 over the entire sky. Since 
the aim of this work is to limit our follow-up observations 
to a manageable number of objects, the specific number 
of quasars expected from VIKING is not directly relevant 
because the overwhelming majority of follow-up candidates 



will not be quasars at all. Nevertheless, the calculation 
is important for two other reasons; firstly to demonstrate 
that follow-up at this stage presents a statistically viable 
prospect of discovering a quasar and secondly because 
doing so will allow qualitative constraints to be placed on 
the luminosity function when the VIKING quasar search 
begins to bear fruit. 

The number of quasars brighter than an apparent mag- 
nitude nip in some fixed observing passband p over a given 
redshift interval may be calculated via integration of the 
quasar luminosity function (QLF) over comoving volume to 
a depth given by the absolute magnitude limit of the survey. 
The late st determinat i on of the QLF at z ~ 6 was under- 
taken bv lWillott et al.l (|2010l ') and combines discoveries from 
the CFHQS with the more luminous SDSS main and deep 
samples. The input catalogue comprises of 40 quasars sam- 
pling the redshift range 5.74 < z < 6.42. The bright end 
of the binned luminosity function is well constrained to a 
power law with some evidence for a flattening in the slope 
provided by a single quasar at lower luminosities. In keeping 
with work at lower redshifts, where the QLF is well con- 
strained over a large magnitude interva l (|Bovle et al.ll2000l : 
ICroom et al. I I2OO9I : iGlikman et al.l^2011^ ■ the Willott et al. 
parametric QLF has a double power law form. 

At z ~ 6, the space de nsity of b r ight S DSS quasars 
(Mi45D < -27) measured by iFan et all (|200ll ) is consistent 
with a sing le power law extrapo lation of the QLF between 
3 < z < 5 (|Schmidt et al.lll995h . The rate of evolution im- 
plied by these luminosity functions corresponds to a value 
k — —0.47, where the space density declines exponentially as 
^Qfc{z-6) Beyond z ~ 6 the value of k is of course completely 
unknown and moreover one cannot rule out the possibility 
of a noticeable luminosity dependence, similar to those in- 
herent in quasar, AG N and galaxy populations at lower red- 
shifts (|C owie et al.l il996 ; Bargo r et al . 2005|; [C room et al.l 
l2009t ). Clearly then, by extrapolating much past z ~ 6 one 
should only make baseline projections. 

We compute a baseline number count model adopting 
the value k — —0.47; we also compute the extreme cases 
k = (no evolution model) and k = —0.94, which cor- 
responds to a factor 3 decline in density per 0.5 increase 
in redshift. Model number count curves are shown in Fig- 
ure 1^ which also includes open circles at the inverse area 
of VIKING, the UKIDSS-LAS and the VISTA Hemisphere 
Survey (VHS) at their respective lOo" sensitivity limits i.e. 
the surface density at which the given survey would contain 
a mean of one quasar. Shaded regions indicate the change 
in number counts when the faint end slope of the QLF 
{13 = -1.5) is allowed to vary between -2.0 < P < -1.0. The 
small deviations from the initial models show that VIKING 
will probe the bright end of this specific QLF. The baseline 
model predicts ~ 8 z > 6.5 quasars brighter than the lOa 
sensitivity limit over the entire VIKING field, which would 
correspond to a total of ~2 quasars in the first 350 deg'^ of 
imaging. 

The UKIDSS eighth data release (DR8) contains some 
2600 deg^ of LAS imaging in at least the Y- and J-bands. 
The baseline number count model predicts that there should 
be a mean of two z > 6.5 quasars in this area, while the 
non-evolving model predicts ~ 3. At present there is one 
reported discovery of a z > 6.5 quasar (z ~ 7 to be a little 
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Figure 6. The number of 6.5 < z < 7.5 quasars br ighter than J, 
for thr ee models based on the luminosity function of lWillott et al.l 
1I2OIOI ). The parameter k determines the rate of density evolution 
beyond z = 6. The short-dashed, solid and long-dashed curves 
correspond to fc = (0.0, —0.47, —0.94) receptively (see text for 
details). Open circles have been placed at the inverse area of 
various surveys, showing the surface density at which the given 
survey would contain a mean of 1 quasar brighter than its lOcr 
detection limit. Shaded regions show the deviations in each curve 
when the faint end slope (/3 = —1.5) is allowed to vary between 
-2.0 < 13 < -1.0. 



more precise) in UKIDSS DR8 (|Mortlock et al.ll201ll ). which 
marginally favours the evolving models. 



4.2 Stellar number counts 

The Besancjon stellar population synthesis model of the 
Galaxy (BGM: iRobin et al.ll2003l : iRobin fc Revlell2003l ) al- 
lows stellar number counts up to spectral type M to be com- 
puted as function of J-band magnitude and direction on the 
sky via an interactive web forirlf|. The model is currently 
undergoing an update to the set of M-dwarf atmospheres it 
uses. The current study therefore provides an ideal testbed 
in which to compare their performance. Here we briefly re- 
view the model inputs and the changes to the M-dwarf at- 
mospheres. 

The atmospheres of cooler L- and T-dwarfs are much 
less well constrained and so we take a more empirical ap- 
proach to calculate their number counts. 



ing a specific treatment. The main ingredients to simulate 
those populations are the density laws, the initial mass func- 
tion (IMF) and the star formation rate. 

At the magnitudes considered here, the more relevant 
parameters to estimate counts of M-dwarf stars are the 
slope of the IMF at low masses and the scale height. Thin 
disc scale height has been obtained from a dynamical ar- 
gument; the scale height depends on the local density, the 
age-velocity dispersion relation and the potential through 
the Boltzmann equation. Hence the thin disc scale height 
is fixed for an isothermal population of a given age and a 
given vertical velocity dispersion. The different scale heights 
or ellipticities as a function of age are given in lRobin et al.l 
(2003). 

The IMF used in the simulations is different from 
the BGM on li n e, wh ich was obtained in an analysis in 
ISchuhheis et al.l (|2006l ) where the slope of the IMF in the 
thick disc has been found to be very steep. Later on this 
IMF was discarded when compared with deep data because 
it overestimated numbers of M-dwarfs in the simulations. 
This mistake was probably due to a default of the calibra- 
tion of the model in the Canada France Hawaii Legacy Sur- 
vey (CFHTLS) bands. The IMF slope has now been revised 
and we use here a slope of a = 0.7 down to very low masses, 
as in lHavwood et aTl (| 19971 ) where the IMF is parametrised 
as dN/dM oc M-'-^+°'\ 

The computations of the magnitudes and colours of 
the stars rely on the stellar atmosphere models. Despite 
of the fact that the Basel library was used for most the 
HR (Hertzsprung- Russell) diagram (|Leieune et al.|[l997l ) , for 
low mass s tars those model s significantly deviate from ob- 
servations (jSchultheis et aL 2006). Instead we use Next Gen 
model atmospheres (jAUard et al.lll997l ) at effective temper- 
atures smaller than about 4000 iC'. At the present time, the 
BGM does not include brown dwarfs of masses smaller than 
the hydrogen burning limit. 

Stellar number counts were calculated on a 2D grid of 
spectral-type and J-mag bins for 30 representative fields of 
lOdeg^ for each of the thin-disc, thick-disc and halo com- 
ponents. The results were scaled up accordingly. In order 
to make certain that density gradients were well sampled, 
effort was made to ensure that each field was offset from 
adjacent fields by no more than 5deg of Galactic latitude. 
J-band number counts were then calculated in the range 
16.0 < J < 24.0 in bin widths of AJ = 0.5. According to the 
quasar number count model this is ~ 2 mag brighter than 
we can realistically expect to recover quasars in VIKING 
and ~ 3 mag fainter than the 5a limiting magnitude of the 
survey. The extra range is included to allow sources to scat- 
ter in and out of the survey as the result of photometric 



4-2.1 The Besangon Stellar Population Synthesis Model 
of the Galaxy: M-dwarf number counts 

The Besangon Galaxy Model has been designed to produce 
simulations of the stellar content in the Galaxy as seen to- 
wards different lines of sight and photometric bands. It is 
based on assumptions about stellar evolution. Galactic evo- 
lution and Galactic dynamics. The model includes 4 popula- 
tions (thin disc, thick disc, halo and bulge) each one deserv- 



http://model.obs-besancon.fr/ 



4-2.2 L- and T- dwarf number counts 

For number counts of L- and T-dwarfs one can consider the 
standard empirical Galactic density model, whereby the to- 
tal contribution from the thin and thick discs is given by 
the sum of two double exponential profiles (e.g. IChen et al.l 
[2001:). A full de scription of t he number count calculation is 
provided by .Caballero et ahl (^2008) so here only the main 
points are reviewed. 

The space density of either disc component at Galactic 
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Table 2. Parameters of the Galact ic thin and thick disc system 
l|Chen et al.ll200ll: I Juric et al.ll2008h 



Rq Zq l h 

(pc) (pc) (pc) (pc) 

8600 ± 200 +27 ± 4 Thin: 2250 ± 1000 330 ± 3 
Thick: 3600 ± 720 580 - 750 
Thick disc normahsation: 13 - 6.5 % 



For N sources distributed over Ji...Jp...Jp magni- 
tude and Ti . . . Tq . . . Tq spectral type intervals, the proce- 
dure is to step over all P x Q bins iteratively. Each object 
receives a set of synthetic fluxes given by = (/z, /y, /j)- 
F is then generated via interpolation of our synthetic pho- 
tometry (Section 2) between a source of the relevant spectral 
type Tq and that of the later adjacent spectral type, Tq+i - If 
the r th occupant of bin p, q has a magnitude J in the range 
Jp < J < Jp+i and a spectral type Tg, then F is defined by 
the following weighted sum: 



coordinates {I, b) and heliocentric distance d assumes the 
following form 



F = aFp^q^r + {I - a) Fp, 



(3) 



i{d, I, b) oc exp 



R{d,l,b) - R(: 



exp 



\Zq + dsin&l 
H 



(1) 

where Zq is the position of the sun relative to the Galactic 
plane and H and L represent the scale height and length 
of the particular disc component. Since VIKING observes 
extra-galactic fields i.e. out of the Galactic plane, the as- 
sumption Rq ^ d is valid and the galactocentric distance 
at the point of interest R{d, l,b), is given by the approxima- 
tion 



R{d, I, b) ~ Rq — dcos b cos I. 



(2) 



The proportionality constant is the spatial density of the 
relevant object class or classes at Z = i.e. at the Galactic 
plane. 

We adopt the local space density and absolute J-band 
magn itude per spectral type estimates of ICaballcro et al.| 
(|2008 ) and calculate stellar number counts per spectral type 
in J- magnitude bins of width 0.5 over the range 16 < J < 24. 
The parameters f or th e thin and thick d iscs are taken from 
IChen et al.l (|200ll ') and lJuric et~all (120081 ) as given in Table[l 

We assume an identical luminosity function in both the 
thin and thick discs. This probably overestimates the count 
of thick-disk brown dwarfs, since old population sub-stellar 
objects are predicted to ha ve very faint mag nitudes by the- 
oretical cooling sequences (lKi rkpatric^ l2005h. This would 
place most of the thick-disk population with Ai < O.OGMq 
below VIKING detection limits. This assumption is there- 
fore conservative since it will overestimate the brown-dwarf 
contamination of the quasar sample. 

A similar argument can be made for the halo, which rep- 
resents an even older population. In this case we assume that 
all halo L- and T-dwarfs will be below VIKING detection 
limits. BGM number counts of halo M-dwarfs fall off rapidly 
towards cooler spectral types. The BGM surface density of 
halo M9-dwarfs with J < 23.0 is < 1 deg~^, suggesting that 
this is a reasonable approach. 



4.3 Generating Synthetic Observations 

With stellar number counts binned by apparent J-band mag- 
nitude and spectral type, the next step is to pair each count 
with a set of synthetic photometry to mimic an observation 
of each source. 



where a is a uniform random deviate in the range < a < 1. 
Each of the components of F (i.e. Fp,q,r and J^q,q+i,,.), 
comprise of Z-, Y- and J-band fiuxes generated by drawing 
Z- and Y-band fiuxes from our library of synthetic photom- 
etry for the relevant spectral type. These are then scaled 
to a reference flux /j, which is derived from a magnitude J 
drawn randomly from the interval Jp < J < Jp+i. This pro- 
cess is repeated over all P x Q bins until all number counts 
are coupled to a set of unique synthetic photometry. 

The VSA VIKING release vl.O currently contains 173 
image tiles. During the image reduction and pipeline pro- 
cessing the average sky noise in each image tile is measured 
by placing a number of circular apertures in sparsely pop- 
ulated regions. These measurements are then placed in the 
header units of each image before being uploaded to the 
VSA. In a real survey of faint sources the distribution of 
measured flux errors is sky dominated and usually well ap- 
proximated as being Gaussian. The intrinsic distribution of 
synthetic fiuxes can therefore be perturbed by drawing from 
a set of Gaussian distributions, with standard deviations 
equal to the sky noise level in the VIKING image tiles and 
scaled to the theoretical fiux of the object in the correspond- 
ing passband. Adding this distribution of errors to the set 
of simulated intrinsic fluxes gives a set of simulated 'obser- 
vations'. 



4.4 Generating a Synthetic Catalogue 

The catalogue extraction process for the VIKING survey, is 
handled by the VISTA Data Flow System (VDFS) at the 
Cambridge Astronomical Survey Unit (CASU). The CASU 
source extraction algorithm does not implement a rigid de- 
tection threshold. Thus to determine whether our synthetic 
sources would be detected by the CASU source extractor 
it is necessary to model the S/N distribution of detected 
so urces f r om th e data. Here we follow a novel method used 
bv lPatell (|2010D . which is described in brief below. 

Since we are modelling a population of point sources it 
is necessary to limit the data analysis to stars only, which 
requires a highly reliable method of star-galaxy separation. 
In contrast to Patcl (201Q.), who used morphological classi- 
fiers to define a reliable and complete sample of point-like 
objects, here we use colour separation by matching sources 
in VIKING, the VISTA Deep Extra-galactic Observations 
survey (VIDEO) and the CFHTLS. These surveys share an 
overlap region of ~ 1 deg^ in the VIDEO tile centred on 
Ra = 02'' 26™ 00' and dec = -04° 30' 00" (the VIDEO- 
XMM3 field). VIDEO observes in the same passbands as 
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Figure 7. The CFHTLS-VIDEO g, i, J, Ks colour plane for 
matched VIKING- VIDEO-CFHTLS sources in the ~ 1 deg^ re- 
gion of overlap shared by each survey. The deep CFHTLS-VIDEO 
observations allow robust star galaxy separation. The red dashed 
line shows the regions in which the star and galaxy populations 
are defined. 



VIKING but is ~ 2.5 mags deeper {5a limiting depths of 
Z = 25.2, Y = 24.0, J = 23.6, H = 22.6 and Ks = 21.7). 
At the time of writing only the Y, J, H and Ks observations 
are available, but this is not important since most high-z 
quasars will be too faint to be detected in VIKING Z-band 
observations given the CASU ~ 5a extraction criterion. 

The GFHTLS Deep survey observes in the Sloan pass- 
bands to limiting depths of it = 26.6, g = 28.0, r = 27.6, i = 
27.0 and z = 25.7. Sources matched between all three sur- 
veys are plotted in the CFHTLS-VIDEO g - i vs. J - Ks 
plane in Figure [7] The extra depth in the matched catalogue 
minimises photometric scatter and, with the combination of 
colours, gives remarkably good star galaxy separation. The 
red dashed line in Figure [7]separates two regions from which 
we draw a sample of stars and galaxies. Where possible i or g 
dropouts have been classified by virtue of either their 2 — J vs 
J — Ks colours or their J — Ks colours alone. In this section 
we concentrate on the stellar sample. 

Histograms of the J-band magnitude distributions of 
matched stars are shown in Figure [8] for VIKING (red) and 
VIDEO (blue) photometry. For comparison, the full distri- 
bution of VIDEO-CFHTLS matched point sources is also 
plotted (black histogram) . Synthetic sources are drawn ran- 
domly from this distribution and each source is perturbed 
by a Gaussian error with a standard deviation equal to the 
sky-noise level drawn from VIKING image tiles. Each per- 
turbed source is then given a detection significance (S/N) 
threshold drawn from a Gaussian distribution with mean /ip 
and standard deviation ap. If a given source possesses a S/N 
greater than it simulated detection threshold then it is kept 
otherwise it is discarded. The resulting distribution is then 
compared by eye to the VIKING- VIDEO-CFHTLS matched 
distribution and the process is repeated with various values 
of Hp and ap until a good match is found. The continuous 
black curve in Figure |8] shows the accepted matched J-band 



Figure 8. The J band magnitude distributions of sources 
matched between VIDEO-CFHTLS (black histogram) and 
VIKING- VIDEO-CFHTLS with VIKING photometry (red his- 
togram) and with VIDEO photometry (blue histogram). The 
black continuous curve shows the distribution of simulated 
sources with intrinsic magnitudes drawn from the VIDEO- 
CFHTLS matched catalogue and perturbed with realistic 
VIKING-like Gaussian errors. A Gaussian distributed signal to 
noise cut has been applied to mimic the source extraction process 
used to produce the VIKING catalogue (see text for details). 



distribution with the parameters (/ij, aj) = (0.22, 0.04). 
The same procedure was undertaken in the Y-band yield- 
ing a Gaussian with parameters (/xy, o"y) = (0.15, 0.06). 
The findings were applied to the set of simulated observa- 
tions described above resulting in a realisation of the cool 
star component of the VIKING catalogue over the entire 
1500 deg2 of the VIKING field. 



5 THE Z, Y, J SELECTION CRITERIA 

The magnitude and colour distribution of the simulated 
stellar population has been compared with data from the 
VIKING- VIDEO-CFHTLS catalogue. The results are shown 
in Figure |9l where we plot the real data in histogram format 
and the simulated catalogue as smooth curves. An excess 
of blue stars can be seen in the colour distributions in the 
left hand panels. Closer inspection of the magnitude distri- 
bution of these stars shows that they are reasonably faint 
with number counts peaking at Z ~ 19.5, Y ~ 19.5 and 
J ~ 19.0. Given their blue colours, they are likely intrin- 
sically bright main sequence halo stars. To promote a fair 
comparison between the true and modelled magnitude dis- 
tributions an effort has been made to remove the majority of 
the blue stars from the data by retaining only those having 
Z - Y > 0.0, Y - J > 0.0, Z - J > 0.3. 

The overall agreement between colour and magnitude 
distributions in the model and the data is good, but the 
model over predicts the number counts in some bins by up 
to a factor of 1.3. The number count predictions are therefore 
conservative. In the modelled population almost all objects 
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are M-dwarfs whose number counts and J-band magnitudes 
are predicted by the BGM. Given the rarity of cooler type 
objects, the Ideg^ patch of deep VIKING- VIDEO-CFHTLS 
matched imaging is not sufficient to constrain the L- and 
T-dwarf number count model well. The overall good agree- 
ment with the data suggests that for the latest type dwarfs, 
the modelled objects are distributed accurately over colour 
and magnitude space and that the predicted number counts 
are within an order of magnitude of their true values. It 
is also encouraging that our resul ts broadly conform with 
data from recent n ear-IR searches (|Burningham et "al]|2010l : 
iLodieu et al.ll200^ '). With a set of well placed colour cuts, L- 
and T-dwarfs are known to make only modest contributions 
to high-z quasar candidate lists in comparison to M-dwarfs, 
so the propagation of these uncertainties will not have a large 
impact on estimates of quasar colour space contamination. 

In the following section the modelled stellar and 
quasar photometry are employed along with data from 
the VIKING- VIDEO-CFHTLS overlap region to investigate 
contamination to quasar colour space and to lay down some 
benchmark selection criteria relevant to the first candidate 
follow-up run. We emphasise the use of the word 'bench- 
mark' here to mean that we will be working conservatively, 
placing tighter constraints than will perhaps be necessary. 
In a real survey it is often useful to relax constraints e.g. so 
that one can consider the errors on candidates around the 
selection cutoff boundaries. 



5.1 Contamination of quasar colour space 

Take as an illustrative example the first follow-up of 
VIKING quasar candidates, which was scheduled over three 
nights in June 2011. As a rough estimate one might expect 
this to be sufficient to make ~ 100 observations. Prior to 
this, there was expected to be ~ 300 deg^ of data available 
from which to choose candidates. This allowed us to set a 
first broad-brush selection constraint e.g. that we must limit 
our follow-up catalogue to ~ 1 candidate per 3 deg^ To 
place further, similar benchmark constraints it is necessary 
to look in more detail at contamination of quasar colour 
space. 

5.1.1 Extra-galactic Contamination 

Up to now, the focus of this work has been on contamination 
from foreground Galactic M-, L- and T-dwarfs. Referring 
back to Figure [2] it is clear that contamination from Galax- 
ies could also pose a problem. Fortunately most galaxies can 
be rejected on the basis of their extended disc-bulge mor- 
phologies. At faint magnitudes, it is possible for the galac- 
tic disc component to fall below the detection limit of the 
survey, while the bulge component remains bright. Galaxies 
detected in this way can be morphologically indistinguish- 
able from stars and usually the best way to avoid this type of 
contamination is to apply a detection significance constraint 
to ensure that candidates are sufficiently well detected in at 
least one band. 

The star galaxy separation techniques discussed in the 
previous section present the opportunity to study the sta- 
tistical properties of VIKING galaxies in detail and place 
further selection constraints on both the morphology and 



detection significance of quasar colour space contaminants. 
The resulting galaxy sample is on the whole slightly less re- 
liable than the stellar sample, since it shares a similar region 
of colour space with low-z (0.1 < z < 2.5) quasars. The lack 
of any SDSS spectroscopic overlap prevents any attempts 
to remove these interlopers with any confid ence, but work 
on optical and near-IR nu mber counts (e.g. [Richards et al.l 
I2OO6I : iMaddox et al.|[20o3 ) suggests that low-z quasars will 
be outnumbered by galaxies in this sample by at least two 
orders of magnitude and they can be reasonably neglected. 

The VSA provides a number of morphological classifiers 
based on the curve of growth of the individual detections of 
each source in a set of progressively inclusive apertures. The 
finer details of this approach are discussed extensively by Ir- 
win et al. (In preparation), the main points are summarised 
as follows; measurements are made on ellipticity and curve- 
of-growth statistics. The results are then compared at the 
detector level to the average stellar locus as a function of 
magnitude. In this way each detection is given a morpholog- 
ical classification statistic known in the VSA as the merged- 
ClassStat (MCS). For stars, this statistic is well approxi- 
mated by an A'^(0, 1) Gaussian distribution. Each individual 
classification is combined to give the overall classification for 
each source. 

At bright magnitudes stars and galaxies populate two 
separate and well defined MCS loci. At faint magnitudes 
the galactic locus becomes indistinguishable from the stellar 
locus. We add to the sample of stars and galaxies, a sam- 
ple of objects which are unmatched between VIKING and 
VIDEO and label these sources as 'noise'. Figure [TD] shows 
the completeness and contamination of the stellar sample by 
galaxies and noise within the following constraints; — 10 < 
MCS < 10 (solid curve), -7 < MCS < 7 (dotted curve), 
-5 < MCS < 5 (dashed curve) and -3.5 < MCS < 3.5 
(dash-dotted curve). It is clear that losses in completeness 
are small compared to gains in minimising contamination. 
However even with the tightest constraints contamination 
contributes to ~ 60 per cent of the sample at the faintest 
magnitudes. 

The VSA supplies a second useful morphological clas- 
sification statistic called the pClass. pClass is derived from 
a discrete statistic which allocates a classification based on 
the passband level MCS and several catches to make clas- 
sification more reliable (e.g. for saturated sources). Each 
discrete classification corresponds to a reasonably accurate, 
self-consistent probability for each of the allowed cases; star, 
galaxy, noise and saturated. For each source the pClass 
statistics for each separately available detection are then 
combined using Bayesian classification rules assuming inde- 
pendence. This defines the probability that a given source 
is either a star, galaxy, noise or saturated, the relevant at- 
tributes in the VSA being pStar, pGalaxy, pNoise and pSat- 
urated. 

The sources in our matched VIKING- VIDEO-CFHTLS 
catalogue are all fainter than J = 17.0 and so in all cases 
pSaturated = 0.0 and pStar -I- pGalaxy -I- pNoise — 1.0. 

We place the following cuts on each of the morphological 
classifiers, 

pNoise < 0.6 
pGalaxy < pStar -f 0.9 
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Figure 9. Left: VIKING Z, Y, J colour distributions of tlie simulated ttiin disc, combined thick disc and halo and total cool star 
contributions in ~ 1 deg^ of the VIKING- VIDEO-CFHTLS overlap region. Each distribution is compared to the real distribution of 
VIKING stellar photometry as shown by the histogram style plots. The excess of blue objects in each plot is attributed to main-sequence 
halo stars with bright absolute magnitudes, an effort has been made to remove these objects in order to compare modelled and true 
magnitude distributions. Right: As in the left hand panel but for the simulated and real Z, Y, J magnitude distributions. In both cases 
the the overall agreement with the data is good. There is a slight overshoot in predicted number counts making these models conservative. 



and in Figure[TO]the contamination and completeness of the 
stellar sample is plotted in grey for —3.5 < MCS < 3.5. 
With the new constraints the contamination from galaxies 
is significantly reduced with only a small reduction in com- 
pleteness as the penalty. 



Figure [TT] (a) plots all galaxies in the Z, Y, J plane and 
galaxies passing the morphological constraints are plotted 
in panel (b). Given the morphological constraints the re- 
maining galaxies in (b) are all indistinguishable from stars. 
We naturally want to select quasars as faint as possible, but 
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Figure 10. Upper: The completeness of VIKING- VIDEO- 
CFHTLS stellar sample with various morphological selection con- 
straints imposed (see legend). Lower: The contamination of the 
VIKING- VIDEO-CFHTLS stellar sample by galaxies and noise 
with various morphological selection constraints imposed. All 
morphological statistics are derived from VIKING imaging. 

experience from other surveys has shown that an Sa to 10a 
detection in at least one band is necessary t o avoid the selec - 
tion of large numbers of false positives (e.g. lFan etahlbOOlh . 
Therefore at the faint end we adopt a nominal photometric 
cut at S/N > 8 in both Y and J. Galaxies in Figure [TT] pass- 
ing this constraint are plotted in panel (c). Only a small 
fraction of the original galaxies remain and most are con- 
centrated within the main galaxy locus. There are perhaps 
half a dozen sources lying sufficiently far from the galaxy 
locus to be mistaken for potential candidates. Since this is 
now the realm of small number statistics it is impossible to 
draw strong conclusions on the frequency of these or simi- 
lar objects. Assuming that this distribution is typical for all 
VIKING galaxies, then in the worst case scenario all outliers 
from the locus would be considered for foUowup. This sug- 
gests, perhaps 10 extra-galactic contaminants per deg'^, 
although this is clearly a weak constraint. Better constraints 
can be placed on the extent of extra-galactic contamination 
when we begin to identify our candidates spectroscopically. 
For now, it is safer to work conservatively by placing a set 
of benchmark colour cuts such that all galaxies are rejected. 
These cuts are shown by the red dashed line in Figure 1111 



Figure 11. The VIKING Z, Y, J colour plane for a highly 
complete sample of galaxies selected from the VIKING- VIDEO- 
CFHTLS overlap region. Panel (a) shows all galaxies, Panel (b) 
shows remaining galaxies after morphological cuts have been im- 
posed and Panel (c) shows the sample after both morphological 
cuts and an 8cr detection significance threshold in Y and J is ap- 
plied. The red dashed line shows a conservative colour selection 
constraint, which rejects all remaining galaxies. 

tribution in redshift. Given the selection space constraints 
derived in the previous section, the task here is simple; to 
refine these criteria to allow ^ 1 interloper per 3 deg^ while 
simultaneously ensuring that the vast majority of quasars lie 
within these constraints. One also must ensure that the re- 
vised constraints do not constitute a relaxation of the above 
constraints. 

The revised selection region is described by Equation |4] 
and shown in Figure [T^] by the dashed line, which encloses 
~ 500 interloping stars. As required, this corresponds to 
~ 1 interloper per 3deg^ over the entire 1500 deg^ of the 
VIKING field. 

Z-Y > 1.375 

Y-J < 0.900 (4) 
Z-Y > 0.750 (Y-J) -Hi. 375 

Finally it is extremely important to verify whether these 
cuts maintain high completeness. 



5.1.2 Stellar Contamination 

Figure [12] plots the simulated cool-star catalogue in Z, Y, 
J colour-colour space. Stars are represented as black points 
and contours. Plotted in blue as open circles is a catalogue 
of ^ 200 simulated 6.5 < z < 7.5 quasars. Each source has 
been 'detected' at the 8 a level in Y and J and was drawn 
from larger catalogue of ~ 1000 quasars with a uniform dis- 



5.2 Selection Completeness 

The estimated completeness contours of the selection strat- 
egy are shown as a function of J and redshift in Figure [131 
Completeness is defined here as the fraction of catalogued 
quasars of intrinsic J-band magnitude J and redshift z, with 
observables that pass the morphological, significance and 
colour constraints outlined above. The total mean complete- 



© XXX RAS, MNRAS 000, [T]-?? 




-0.5 0.0 0.5 1.0 1.5 

Y - J 



Figure 12. The VIKING Z, Y, J colour plane for simulated cool 
stars (contours and points) in the VIKING catalogue. Also shown 
are ~ 200 simulated 6.5 < z < 7.5 quasars (open circles) drawn 
from a uniform distribution in redshift and 'detected' at the So- 
level in Y and J. The red dashed line defines a benchmark colour 
selection region that encloses ~ 1 object per 3 deg^ as was re- 
quired by the conditions of our follow-up campaign. 



ness of the survey is ~ 60 per cent and is robust against 
moderate changes in the selection criteria that maintain the 
^ 1 interloper per Sdeg'^ requirement. 

The conservative selection constraints imposed on the 
simulated catalogue maintain a completeness that is com- 
parable to but slightly lower than the highly successful 
SDSS and CFHTLS quasar sea rches in the optical and the 
UKIDSS search in the ne a r-IR llFan et al.ll2003l : Ijiang et all 
l2008l : IWillott et all l2010l : iPatell l2010l )~The diflference can 
be accounted for firstly by our conservative selection con- 
straints and secondly because completeness estimations in 
these searches do not consider the losses due to morphologi- 
cal rejection. That said, it is important to mention that the 
completeness will be further affected by a few additional fac- 
tors not modelled here. Firstly, loss of completeness due to 
survey 'holes' caused by ghosts around bright stars; secondly, 
blending with foreground objects causing genuine quasars to 
move bluewards in Z — Y and be rejected and finally, non- 
Gaussian errors and spurious detections causing increased 
contamination compared to our models. 

The bright-star losses are expected to be relatively 
small, rejecting typically a few percent of the sky. Sec- 
ondly, the losses from chance projection of foreground ob- 
jects are also modest; at Z ~ 23 the surface density of ob- 
jects is ~ 10 per arcmin^, so if we assume (conservatively) 
real quasars will be rejected if closer than 2 arcsec to such 
an object, this will cause an inefficiency of 3.5 per cent. 
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Figure 13. Estimated VIKING survey completeness for the 
benchmark selection constraints. Contours are plotted in J vs. 
z and increase in intervals on 0.1. The selection function accounts 
for loss of completeness from all significance, colour and morpho- 
logical constraints described in the text. 

Finally, there are many sources of non-Gaussian errors 
which may create spurious Z-dropouts, including moving ob- 
jects such as asteroids, image persistence, hot pixels, etc. 
The VIKING observing strategy ensures that each object is 
observed at approximately 8 jitter positions on different pix- 
els, and also that the J-band imaging is split into two epochs 
separated by weeks or months. Using this, we expect that 
almost all spurious, time-variable and moving objects can 
be rejected by checking of individual data frames, though 
the precise number remains to be quantified in detail. 

With a mean completeness of ~ 60 per cent, the quasar 
number count models derived in Section |31 suggest that 
2 6.5 < z < 7.5 quasars brighter than the VIKING 8a 
point-source sensitivity limit will be selected from the first 
~ 350 deg^ of VIKING imaging, by the benchmark selection 
constraints. 



6 NTT IMAGING OBSERVATIONS 

The first VIKING quasar candidates were observed in Intt 
and Zntt imaging by the ESQ Faint Object Spectrograph 
and Camera II (EFOSC2) on ESO's New Technology Tele- 
scop(|3 (NTT) over a three night period between 26th - 29th 
June 2011. In practise both trad itional colour selectio n and 
probabilistic candidate ranking ijMortlock et al.ll201ll ) were 
employed independently to converge on the final candidate 
list and as recommended in Section [5] the benchmark colour 
and significance constraints were relaxed considerably in or- 
der to remain inclusive. Here we examine the VIKING Z, Y, 

Based on observations made with ESQ Telescopes at the La 
Silla Paranal Observatory under programme ID 087.A-0655(A) 
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J photometry of these candidates and discuss imphcations 
for the benchmark selection constraints in hght of the NTT 
observations. 

In total 129 science exposures were taken of 44 candi- 
date£3- 

All candidates were observed in Intt to depths of 
> 22.8 mag at the Scr noise level. At these depths z > 6.5 
quasars are dark in optical passbands so candidates showing 
a visible flux peak in these images were discarded. In total 
just 6 candidates appeared undetected in Intt images that 
reached 3a magnitude limits of Intt > 23.7 (this has since 
been confirmed with aperture photometry). 

In Figure[l4]the dashed line shows the benchmark selec- 
tion region in the VIKING Z, Y, J colour-colour plane. The 
filled grey region shows the set of relaxed constraints used in 
the search. Dark grey points represent objects which pass the 
benchmark constraints but have not undergone follow-up. A 
small fraction remain strong candidates and are awaiting ob- 
servation, but most were rejected after probabilistic ranking 
or because they are associated with an optical source in the 
SDSS overlap region. Crosses and open circles show Intt vis- 
ible and Intt drop-out candidates respectively. As is clear 
from the scatter of these points outside of the grey region, 
even the relaxed constraints were not strictly adhered to. 

The Intt drop-out lying at Z - Y ~ 1.25, Y - J ~ 0.7 
is a faint (S/N ~ 6 in Y) low-priority candidate and would 
not normally have been observed but for a nightly period in 
which most high priority candidates were not visible. Given 
its low priority it was not observed in Zntt. Excluding this 
object there remain five strong Intt drop-out candidates 
which were subsequently followed up in Zntt in order to 
confirm a break shortward of the VISTA Y-band. All were 
found to have a combination of colours in Intt, Zntt, Y 
and J consistent with 6.5 < z < 7.0 quasars. Four of these 
candidates lie firmly within the benchmark colour selection 
constraints and a fifth lies on the boundary, three candi- 
dates lie just outside the significance constraints (S/N > 7.3; 
which serves as a reminder of the importance of widening 
the search space in order to consider the colour uncertain- 
ties on objects lying around the selection boundaries). The 
clustering of these candidates in and around the selection re- 
gion provides considerable support for the choice of selection 
constraints. A detailed discussion and analysis of the NTT 
photometry, as well as the spectroscopy of these candidates 
is given by Venemans et al. (2011). 



7 SUMMARY AND FUTURE WORK 

The first public data release of the VIKING survey will 
be delivered to ESQ in mid- to late-2011. Approximately 
75 deg^ is expected to be made public. VIKING is expected 
to contain a significant sample of 6.5 < z < 7.5 quasars, 
the precise number depending sensitively on the unknown 
evolution of the quasar luminosity function beyond z ~ 6. 

Follow-up of the first VIKING quasar candidates was 
conducted in June 2011. Candidates were chosen from ~ 



Given the modest area covered by VIKING at this stage 
(~ 350 deg^), it should be noted that the observation run was 
heavily dictated by the visibility of the patchy VIKING footprint 
throughout the night. Thus the observations presented here do 
not represent an exhaustive list of our highest priority candidates. 



O Undetected in 1^, 
+ Detected in 
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Figure 14. High-z quasar candidates in the VIKING Z, Y, J 
colour plane. The dashed line shows the benchmark colour se- 
lection region. Grey circles show all VIKING candidates falling 
within this region after applying the benchmark morphological 
and photometric constraints described in the main text. A frac- 
tion of these objects are high priority candidates awaiting follow- 
up but most were rejected as result of probabilistic ranking or 
due to optical associations with SDSS sources. Crosses and open 
circles are candidates observed in Intt and Zntt • Open cir- 
cles were undetected in Intt a-nd have IntTi ^ntT; Y, J colours 
consistent with 6.5 < z < 7.0 quasars. Error bars show la uncer- 
tainties. 



350 deg'^ of data available at the time. To conduct a full and 
rigorous search of this data set a selection strategy was re- 
quired in advance of the availability of most science ready 
data products. This made it difficult to develop a robust set 
of quasar selection constraints which optimise both efficiency 
in photometric and spectroscopic follow-up, whilst simulta- 
neously selecting a highly complete sample of quasars. 

To remedy this, we have modelled the cool star compo- 
nent of the VIKING catalogue in the Z, Y, J passbands to 
mimic contamination of quasar colour space by dwarf stars 
of spectral type M, L and T. We have compared this model 
to ~ 1 deg^ of imaging in the VIKING- VIDEO-CFHTLS 
overlap region using photometry from the complementary 
g, i, J, Ks colours as a robust star-galaxy separator. The 
model and data are in good agreement and we use it along 
with a sample of galaxies from the same overlapping data 
set to place constraints on the photometric and morpho- 
logical characteristics of quasar colour space contaminants. 
This allows us to define a set of benchmark photometric 
and morphological selection cuts which will limit follow-up 
candidates to ~ 1 object per 3deg'^, an initial requirement 
of the follow-up campaign. The completeness of this search 
strategy is comparable to similar quasar searches conducted 
both in the optical and the near-IR. 

Follow-up imaging of the first quasar candidates from 
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the NTT provides considerable support for the choice of 
benchmark constraints. Of 44 candidates observed in either 
Intt or both Intt and Zntt, just 5 have colours consis- 
tent with 6.5 < z < 7.5 quasars. All 5 of these sources have 
VIKING colours that place them within our benchmark se- 
lection constraints. 

Several other optical and near-IR surveys are due to ei- 
ther see their first light or to release their first products in 
the near future (e.g. Pan-STARRS, VST-KIDS, VHS). The 
tendency with large surveys is to place priority initially on 
the timely release of a small volume of data from a few well 
studied fields (VIKING being no exception). The method- 
ology described here will provide an equally complementary 
set of methods with which to assess the first data from these 
forthcoming catalogues, allowing benchmark constraints to 
be placed on all rare object searches. 

As surveys and corresponding data volumes begin to 
enter the Peta-byte era, rare object searches employing sim- 
ple colour cuts are becoming cumbersome and inefficient. 
Several authors have already begun to prioritise follow-up 
candidates by combining observables to place them in a well 
defin e d ranking scheme ijRichards et al.l l2009l : lYeche et al.l 
I2OIOI : IBovv et al.ll201ll '). The first such ranking algorithm 
for high-z quasar candidates, uses Bayesian model compar- 
ison techniques to determine the probabili ty that a given 
sourc e has scattered from the quasar locus (jMortlock et al.l 
In principle this methodology does away with colour 
cuts all together retaining and treating information that is 
normally lost or considered qualitatively, in a robust mathe- 
matical framework. The algorithm presented by Mortlock et 
al. requires accurate models of the photometry of cool stars 
and quasars specific to the particular survey it is applied 
to. Ideally the stellar model is constrained via maximum 
likelihood fits to the real data. In the case of VIKING and 
several other surveys this is not possible with the limited 
data expected from initial releases. 

In a forthcoming paper we will present quasar candi- 
dates selected from the VSA by applying the stellar model 
developed here to the Mortlock et al. algorithm. We will 
discuss the success of this approach to modelling the stel- 
lar population by characterising a sample of test set data. 
We will also discuss in detail, the techniques necessary for 
defining the input catalogue from VSA archived photome- 
try. Our hope is that this will provide a starting point from 
which to encourage future independent quasar searches in 
the VIKING data. This shows the diversity of the tools de- 
veloped here in the context of high-z quasar and other rare 
object searches. 
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